Automatic Segmentation of Manipuri (Meiteilon) Word into Syllabic Units
نویسندگان
چکیده
The work of automatic segmentation of a Manipuri language (or Meiteilon) word into syllabic units is demonstrated in this paper. This language is a scheduled Indian language of TibetoBurman origin, which is also a very highly agglutinative language. This language usages two script: a Bengali script and Meitei Mayek (Script). The present work is based on the second script. An algorithm is designed so as to identify mainly the syllables of Manipuri origin word. The result of the algorithm shows a Recall of 74.77, Precision of 91.21 and F-Score of 82.18 which is a reasonable score with the first attempt of such kind for this language.
منابع مشابه
Early Syllabic Segmentation of Fluent Speech by Infants Acquiring French
Word form segmentation abilities emerge during the first year of life, and it has been proposed that infants initially rely on two types of cues to extract words from fluent speech: Transitional Probabilities (TPs) and rhythmic units. The main goal of the present study was to use the behavioral method of the Headturn Preference Procedure (HPP) to investigate again rhythmic segmentation of sylla...
متن کاملUnsupervised word discovery from speech using automatic segmentation into syllable-like units
This paper presents a syllable-based approach to unsupervised pattern discovery from speech. By first segmenting speech into syllable-like units, the system is able to limit potential word onsets and offsets to a finite number of candidate locations. These syllable tokens are then described using a set of features and clustered into a finite number of syllable classes. Finally, recurring syllab...
متن کاملAutomatic Segmentation of Wave File
This paper presents an ASS (Automatic Speech Segmentation) Technique to segment spontaneous speech into syllable like units. In the development of a syllable-centric ASS system, segmentation of the acoustic signal into syllabic units is an important stage. In this paper we focus on the identifying minimum unit of speech to be considered while training any speech recognition system. There are sy...
متن کاملAutomatic Syllabification for Manipuri language
Development of hand crafted rule for syllabifying words of a language is an expensive task. This paper proposes several data-driven methods for automatic syllabification of words written in Manipuri language. Manipuri is one of the scheduled Indian languages. First, we propose a language-independent rule-based approach formulated using entropy based phonotactic segmentation. Second, we project ...
متن کاملProsodic parsing for Swedish speech recognition
A prosodic parsing system is described which uses tonal, intensity and durational information to recognize prosodic categories. An automatic segmentation algorithm first divides the utterance into "tonal segments" which roughly correspond to syllabic units. A vowel intensity threshold routine then sorts out probable unstressed syllables. Recognition rules are applied to the remaining syllables....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1207.3932 شماره
صفحات -
تاریخ انتشار 2012